Overview
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 20091 |
| Missing cells | 4050 |
| Missing cells (%) | 1.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.3 MiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 6 |
|---|---|
| Text | 4 |
| Categorical | 4 |
| DateTime | 1 |
income has 612 (3.0%) missing values | Missing |
payment_mode has 697 (3.5%) missing values | Missing |
date has 491 (2.4%) missing values | Missing |
category has 1084 (5.4%) missing values | Missing |
stock has 711 (3.5%) missing values | Missing |
Reproduction
| Analysis started | 2026-02-24 09:05:13.923770 |
|---|---|
| Analysis finished | 2026-02-24 09:05:20.113977 |
| Duration | 6.19 seconds |
| Software version | ydata-profiling vv4.18.1 |
| Download configuration | config.json |
Variables
customer_id
Real number (ℝ)
| Distinct | 5000 |
|---|---|
| Distinct (%) | 24.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2513.8754 |
| Minimum | 1 |
|---|---|
| Maximum | 5000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 157.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 258 |
| Q1 | 1260 |
| median | 2524 |
| Q3 | 3771 |
| 95-th percentile | 4750 |
| Maximum | 5000 |
| Range | 4999 |
| Interquartile range (IQR) | 2511 |
Descriptive statistics
| Standard deviation | 1444.2631 |
|---|---|
| Coefficient of variation (CV) | 0.57451658 |
| Kurtosis | -1.20165 |
| Mean | 2513.8754 |
| Median Absolute Deviation (MAD) | 1256 |
| Skewness | -0.013086578 |
| Sum | 50506271 |
| Variance | 2085895.9 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1366 | 14 | 0.1% |
| 138 | 13 | 0.1% |
| 605 | 13 | 0.1% |
| 2916 | 13 | 0.1% |
| 4948 | 12 | 0.1% |
| 1093 | 11 | 0.1% |
| 4578 | 11 | 0.1% |
| 4256 | 11 | 0.1% |
| 2684 | 11 | 0.1% |
| 4871 | 11 | 0.1% |
| Other values (4990) | 19971 |
| Value | Count | Frequency (%) |
| 1 | 3 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 9 | |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 8 | |
| 7 | 5 | |
| 8 | 5 | |
| 9 | 5 | |
| 10 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 5000 | 6 | |
| 4999 | 4 | |
| 4998 | 5 | |
| 4997 | 1 | < 0.1% |
| 4996 | 1 | < 0.1% |
| 4995 | 4 | |
| 4994 | 4 | |
| 4993 | 3 | |
| 4992 | 1 | < 0.1% |
| 4991 | 5 |
name
Text
| Distinct | 200 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 157.1 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 10.968941 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Arjun Verma |
|---|---|
| 2nd row | Arjun Verma |
| 3rd row | Arjun Verma |
| 4th row | Shaurya Khan |
| 5th row | Anika Verma |
| Value | Count | Frequency (%) |
| reddy | 2193 | 5.5% |
| sharma | 2145 | 5.3% |
| iyer | 2052 | 5.1% |
| mehta | 2018 | 5.0% |
| gupta | 1992 | 5.0% |
| singh | 1972 | 4.9% |
| patel | 1968 | 4.9% |
| verma | 1964 | 4.9% |
| khan | 1938 | 4.8% |
| nair | 1849 | 4.6% |
| Other values (20) | 20091 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 43986 | |
| 20091 | 9.1% | |
| r | 15844 | 7.2% |
| h | 14101 | 6.4% |
| y | 13355 | 6.1% |
| i | 13096 | 5.9% |
| n | 12930 | 5.9% |
| e | 11277 | 5.1% |
| t | 7056 | 3.2% |
| S | 6958 | 3.2% |
| Other values (20) | 61683 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220377 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 43986 | |
| 20091 | 9.1% | |
| r | 15844 | 7.2% |
| h | 14101 | 6.4% |
| y | 13355 | 6.1% |
| i | 13096 | 5.9% |
| n | 12930 | 5.9% |
| e | 11277 | 5.1% |
| t | 7056 | 3.2% |
| S | 6958 | 3.2% |
| Other values (20) | 61683 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220377 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 43986 | |
| 20091 | 9.1% | |
| r | 15844 | 7.2% |
| h | 14101 | 6.4% |
| y | 13355 | 6.1% |
| i | 13096 | 5.9% |
| n | 12930 | 5.9% |
| e | 11277 | 5.1% |
| t | 7056 | 3.2% |
| S | 6958 | 3.2% |
| Other values (20) | 61683 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220377 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 43986 | |
| 20091 | 9.1% | |
| r | 15844 | 7.2% |
| h | 14101 | 6.4% |
| y | 13355 | 6.1% |
| i | 13096 | 5.9% |
| n | 12930 | 5.9% |
| e | 11277 | 5.1% |
| t | 7056 | 3.2% |
| S | 6958 | 3.2% |
| Other values (20) | 61683 |
age
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.627395 |
| Minimum | 18 |
|---|---|
| Maximum | 69 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 157.1 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 30 |
| median | 44 |
| Q3 | 57 |
| 95-th percentile | 67 |
| Maximum | 69 |
| Range | 51 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 15.029731 |
|---|---|
| Coefficient of variation (CV) | 0.34450214 |
| Kurtosis | -1.2120725 |
| Mean | 43.627395 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.01223145 |
| Sum | 876518 |
| Variance | 225.89282 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43 | 487 | 2.4% |
| 46 | 468 | 2.3% |
| 66 | 462 | 2.3% |
| 20 | 437 | 2.2% |
| 28 | 429 | 2.1% |
| 30 | 426 | 2.1% |
| 38 | 425 | 2.1% |
| 40 | 423 | 2.1% |
| 19 | 422 | 2.1% |
| 62 | 421 | 2.1% |
| Other values (42) | 15691 |
| Value | Count | Frequency (%) |
| 18 | 327 | |
| 19 | 422 | |
| 20 | 437 | |
| 21 | 416 | |
| 22 | 338 | |
| 23 | 373 | |
| 24 | 369 | |
| 25 | 365 | |
| 26 | 328 | |
| 27 | 401 |
| Value | Count | Frequency (%) |
| 69 | 366 | |
| 68 | 411 | |
| 67 | 296 | |
| 66 | 462 | |
| 65 | 412 | |
| 64 | 387 | |
| 63 | 386 | |
| 62 | 421 | |
| 61 | 408 | |
| 60 | 403 |
gender
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 157.1 KiB |
| Male | |
|---|---|
| Other | |
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.9843711 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Male |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 6805 | |
| Other | 6795 | |
| Female | 6491 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 6805 | |
| other | 6795 | |
| female | 6491 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 26582 | |
| a | 13296 | |
| l | 13296 | |
| M | 6805 | 6.8% |
| O | 6795 | 6.8% |
| t | 6795 | 6.8% |
| h | 6795 | 6.8% |
| r | 6795 | 6.8% |
| F | 6491 | 6.5% |
| m | 6491 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100141 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 26582 | |
| a | 13296 | |
| l | 13296 | |
| M | 6805 | 6.8% |
| O | 6795 | 6.8% |
| t | 6795 | 6.8% |
| h | 6795 | 6.8% |
| r | 6795 | 6.8% |
| F | 6491 | 6.5% |
| m | 6491 | 6.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100141 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 26582 | |
| a | 13296 | |
| l | 13296 | |
| M | 6805 | 6.8% |
| O | 6795 | 6.8% |
| t | 6795 | 6.8% |
| h | 6795 | 6.8% |
| r | 6795 | 6.8% |
| F | 6491 | 6.5% |
| m | 6491 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100141 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 26582 | |
| a | 13296 | |
| l | 13296 | |
| M | 6805 | 6.8% |
| O | 6795 | 6.8% |
| t | 6795 | 6.8% |
| h | 6795 | 6.8% |
| r | 6795 | 6.8% |
| F | 6491 | 6.5% |
| m | 6491 | 6.5% |
city
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 157.1 KiB |
| Mumbai | |
|---|---|
| Bangalore | |
| Pune | |
| Hyderabad | |
| Delhi | |
| Other values (5) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.6996665 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Jaipur |
|---|---|
| 2nd row | Jaipur |
| 3rd row | Jaipur |
| 4th row | Hyderabad |
| 5th row | Surat |
Common Values
| Value | Count | Frequency (%) |
| Mumbai | 2109 | |
| Bangalore | 2105 | |
| Pune | 2071 | |
| Hyderabad | 2065 | |
| Delhi | 2049 | |
| Kolkata | 1993 | |
| Jaipur | 1962 | |
| Ahmedabad | 1927 | |
| Surat | 1923 | |
| Chennai | 1887 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mumbai | 2109 | |
| bangalore | 2105 | |
| pune | 2071 | |
| hyderabad | 2065 | |
| delhi | 2049 | |
| kolkata | 1993 | |
| jaipur | 1962 | |
| ahmedabad | 1927 | |
| surat | 1923 | |
| chennai | 1887 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 24061 | |
| e | 12104 | 9.0% |
| u | 8065 | 6.0% |
| r | 8055 | 6.0% |
| i | 8007 | 5.9% |
| d | 7984 | 5.9% |
| n | 7950 | 5.9% |
| l | 6147 | 4.6% |
| b | 6101 | 4.5% |
| h | 5863 | 4.4% |
| Other values (17) | 40266 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 134603 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 24061 | |
| e | 12104 | 9.0% |
| u | 8065 | 6.0% |
| r | 8055 | 6.0% |
| i | 8007 | 5.9% |
| d | 7984 | 5.9% |
| n | 7950 | 5.9% |
| l | 6147 | 4.6% |
| b | 6101 | 4.5% |
| h | 5863 | 4.4% |
| Other values (17) | 40266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 134603 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 24061 | |
| e | 12104 | 9.0% |
| u | 8065 | 6.0% |
| r | 8055 | 6.0% |
| i | 8007 | 5.9% |
| d | 7984 | 5.9% |
| n | 7950 | 5.9% |
| l | 6147 | 4.6% |
| b | 6101 | 4.5% |
| h | 5863 | 4.4% |
| Other values (17) | 40266 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 134603 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 24061 | |
| e | 12104 | 9.0% |
| u | 8065 | 6.0% |
| r | 8055 | 6.0% |
| i | 8007 | 5.9% |
| d | 7984 | 5.9% |
| n | 7950 | 5.9% |
| l | 6147 | 4.6% |
| b | 6101 | 4.5% |
| h | 5863 | 4.4% |
| Other values (17) | 40266 |
income
Real number (ℝ)
Missing
| Distinct | 4835 |
|---|---|
| Distinct (%) | 24.8% |
| Missing | 612 |
| Missing (%) | 3.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 652747.93 |
| Minimum | 100005 |
|---|---|
| Maximum | 9978234 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 157.1 KiB |
Quantile statistics
| Minimum | 100005 |
|---|---|
| 5-th percentile | 125656 |
| Q1 | 241539.5 |
| median | 449146 |
| Q3 | 791307 |
| 95-th percentile | 1638781 |
| Maximum | 9978234 |
| Range | 9878229 |
| Interquartile range (IQR) | 549767.5 |
Descriptive statistics
| Standard deviation | 833133.56 |
|---|---|
| Coefficient of variation (CV) | 1.2763481 |
| Kurtosis | 52.977278 |
| Mean | 652747.93 |
| Median Absolute Deviation (MAD) | 241129 |
| Skewness | 6.2197965 |
| Sum | 1.2714877 × 1010 |
| Variance | 6.9411152 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 714967 | 14 | 0.1% |
| 605757 | 13 | 0.1% |
| 184015 | 13 | 0.1% |
| 1028563 | 13 | 0.1% |
| 354431 | 13 | 0.1% |
| 516022 | 12 | 0.1% |
| 260110 | 12 | 0.1% |
| 165852 | 11 | 0.1% |
| 557776 | 11 | 0.1% |
| 125913 | 11 | 0.1% |
| Other values (4825) | 19356 | |
| (Missing) | 612 | 3.0% |
| Value | Count | Frequency (%) |
| 100005 | 10 | |
| 100093 | 1 | < 0.1% |
| 100120 | 3 | < 0.1% |
| 100221 | 2 | < 0.1% |
| 100358 | 6 | |
| 100394 | 4 | < 0.1% |
| 100618 | 9 | |
| 100677 | 4 | < 0.1% |
| 100783 | 6 | |
| 100822 | 6 |
| Value | Count | Frequency (%) |
| 9978234 | 3 | |
| 9908240 | 3 | |
| 9809045 | 3 | |
| 9779897 | 2 | < 0.1% |
| 9777376 | 5 | |
| 9756004 | 6 | |
| 9742256 | 3 | |
| 9649292 | 2 | < 0.1% |
| 9320295 | 2 | < 0.1% |
| 9149570 | 3 |
transaction_id
Text
| Distinct | 20000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 91 |
| Missing (%) | 0.5% |
| Memory size | 157.1 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 20000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | T002318 |
|---|---|
| 2nd row | T004426 |
| 3rd row | T012020 |
| 4th row | T004924 |
| 5th row | T002934 |
| Value | Count | Frequency (%) |
| t002318 | 1 | < 0.1% |
| t004426 | 1 | < 0.1% |
| t009674 | 1 | < 0.1% |
| t010681 | 1 | < 0.1% |
| t011573 | 1 | < 0.1% |
| t011822 | 1 | < 0.1% |
| t006657 | 1 | < 0.1% |
| t014641 | 1 | < 0.1% |
| t019666 | 1 | < 0.1% |
| t004453 | 1 | < 0.1% |
| Other values (19990) | 19990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37999 | |
| T | 20000 | |
| 1 | 18000 | |
| 2 | 8001 | 5.7% |
| 3 | 8000 | 5.7% |
| 8 | 8000 | 5.7% |
| 4 | 8000 | 5.7% |
| 6 | 8000 | 5.7% |
| 9 | 8000 | 5.7% |
| 7 | 8000 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 140000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37999 | |
| T | 20000 | |
| 1 | 18000 | |
| 2 | 8001 | 5.7% |
| 3 | 8000 | 5.7% |
| 8 | 8000 | 5.7% |
| 4 | 8000 | 5.7% |
| 6 | 8000 | 5.7% |
| 9 | 8000 | 5.7% |
| 7 | 8000 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 140000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37999 | |
| T | 20000 | |
| 1 | 18000 | |
| 2 | 8001 | 5.7% |
| 3 | 8000 | 5.7% |
| 8 | 8000 | 5.7% |
| 4 | 8000 | 5.7% |
| 6 | 8000 | 5.7% |
| 9 | 8000 | 5.7% |
| 7 | 8000 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 140000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37999 | |
| T | 20000 | |
| 1 | 18000 | |
| 2 | 8001 | 5.7% |
| 3 | 8000 | 5.7% |
| 8 | 8000 | 5.7% |
| 4 | 8000 | 5.7% |
| 6 | 8000 | 5.7% |
| 9 | 8000 | 5.7% |
| 7 | 8000 | 5.7% |
product_id
Text
| Distinct | 1000 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 91 |
| Missing (%) | 0.5% |
| Memory size | 157.1 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P0468 |
|---|---|
| 2nd row | P0810 |
| 3rd row | P0821 |
| 4th row | P0633 |
| 5th row | P0350 |
| Value | Count | Frequency (%) |
| p0905 | 36 | 0.2% |
| p0428 | 35 | 0.2% |
| p0644 | 33 | 0.2% |
| p0281 | 32 | 0.2% |
| p0136 | 32 | 0.2% |
| p0362 | 31 | 0.2% |
| p0749 | 31 | 0.2% |
| p0119 | 31 | 0.2% |
| p0717 | 31 | 0.2% |
| p0683 | 31 | 0.2% |
| Other values (990) | 19677 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 25847 | |
| P | 20000 | |
| 7 | 6142 | 6.1% |
| 9 | 6067 | 6.1% |
| 4 | 6015 | 6.0% |
| 1 | 6004 | 6.0% |
| 8 | 6002 | 6.0% |
| 3 | 5997 | 6.0% |
| 6 | 5986 | 6.0% |
| 2 | 5980 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 25847 | |
| P | 20000 | |
| 7 | 6142 | 6.1% |
| 9 | 6067 | 6.1% |
| 4 | 6015 | 6.0% |
| 1 | 6004 | 6.0% |
| 8 | 6002 | 6.0% |
| 3 | 5997 | 6.0% |
| 6 | 5986 | 6.0% |
| 2 | 5980 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 25847 | |
| P | 20000 | |
| 7 | 6142 | 6.1% |
| 9 | 6067 | 6.1% |
| 4 | 6015 | 6.0% |
| 1 | 6004 | 6.0% |
| 8 | 6002 | 6.0% |
| 3 | 5997 | 6.0% |
| 6 | 5986 | 6.0% |
| 2 | 5980 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 25847 | |
| P | 20000 | |
| 7 | 6142 | 6.1% |
| 9 | 6067 | 6.1% |
| 4 | 6015 | 6.0% |
| 1 | 6004 | 6.0% |
| 8 | 6002 | 6.0% |
| 3 | 5997 | 6.0% |
| 6 | 5986 | 6.0% |
| 2 | 5980 | 6.0% |
amount
Real number (ℝ)
| Distinct | 4826 |
|---|---|
| Distinct (%) | 24.1% |
| Missing | 91 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1728.4613 |
| Minimum | 100 |
|---|---|
| Maximum | 49912 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 157.1 KiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 177 |
| Q1 | 532.75 |
| median | 1142 |
| Q3 | 2190 |
| 95-th percentile | 4700 |
| Maximum | 49912 |
| Range | 49812 |
| Interquartile range (IQR) | 1657.25 |
Descriptive statistics
| Standard deviation | 2624.8465 |
|---|---|
| Coefficient of variation (CV) | 1.518603 |
| Kurtosis | 138.23852 |
| Mean | 1728.4613 |
| Median Absolute Deviation (MAD) | 724 |
| Skewness | 9.8327234 |
| Sum | 34569226 |
| Variance | 6889818.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 161 | 23 | 0.1% |
| 409 | 22 | 0.1% |
| 243 | 22 | 0.1% |
| 127 | 21 | 0.1% |
| 160 | 21 | 0.1% |
| 506 | 21 | 0.1% |
| 114 | 20 | 0.1% |
| 180 | 20 | 0.1% |
| 196 | 20 | 0.1% |
| 133 | 20 | 0.1% |
| Other values (4816) | 19790 | |
| (Missing) | 91 | 0.5% |
| Value | Count | Frequency (%) |
| 100 | 18 | |
| 101 | 13 | |
| 102 | 10 | |
| 103 | 6 | < 0.1% |
| 104 | 3 | < 0.1% |
| 105 | 16 | |
| 106 | 15 | |
| 107 | 8 | |
| 108 | 16 | |
| 109 | 9 |
| Value | Count | Frequency (%) |
| 49912 | 1 | |
| 48978 | 1 | |
| 48705 | 1 | |
| 48145 | 1 | |
| 47861 | 1 | |
| 47694 | 1 | |
| 47281 | 1 | |
| 47071 | 1 | |
| 46411 | 1 | |
| 46364 | 1 |
payment_mode
Categorical
Missing
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 697 |
| Missing (%) | 3.5% |
| Memory size | 157.1 KiB |
| Net Banking | |
|---|---|
| Credit Card | |
| Cash | |
| UPI | |
| Debit Card |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 7.8112818 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UPI |
|---|---|
| 2nd row | Net Banking |
| 3rd row | Cash |
| 4th row | Cash |
| 5th row | UPI |
Common Values
| Value | Count | Frequency (%) |
| Net Banking | 3948 | |
| Credit Card | 3918 | |
| Cash | 3887 | |
| UPI | 3856 | |
| Debit Card | 3785 | |
| (Missing) | 697 | 3.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| card | 7703 | |
| banking | 3948 | |
| net | 3948 | |
| credit | 3918 | |
| cash | 3887 | |
| upi | 3856 | |
| debit | 3785 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 15538 | |
| C | 15508 | |
| e | 11651 | 7.7% |
| t | 11651 | 7.7% |
| i | 11651 | 7.7% |
| 11651 | 7.7% | |
| d | 11621 | 7.7% |
| r | 11621 | 7.7% |
| n | 7896 | 5.2% |
| N | 3948 | 2.6% |
| Other values (10) | 38756 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 151492 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 15538 | |
| C | 15508 | |
| e | 11651 | 7.7% |
| t | 11651 | 7.7% |
| i | 11651 | 7.7% |
| 11651 | 7.7% | |
| d | 11621 | 7.7% |
| r | 11621 | 7.7% |
| n | 7896 | 5.2% |
| N | 3948 | 2.6% |
| Other values (10) | 38756 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 151492 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 15538 | |
| C | 15508 | |
| e | 11651 | 7.7% |
| t | 11651 | 7.7% |
| i | 11651 | 7.7% |
| 11651 | 7.7% | |
| d | 11621 | 7.7% |
| r | 11621 | 7.7% |
| n | 7896 | 5.2% |
| N | 3948 | 2.6% |
| Other values (10) | 38756 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 151492 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 15538 | |
| C | 15508 | |
| e | 11651 | 7.7% |
| t | 11651 | 7.7% |
| i | 11651 | 7.7% |
| 11651 | 7.7% | |
| d | 11621 | 7.7% |
| r | 11621 | 7.7% |
| n | 7896 | 5.2% |
| N | 3948 | 2.6% |
| Other values (10) | 38756 |
date
Date
Missing
| Distinct | 365 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 491 |
| Missing (%) | 2.4% |
| Memory size | 157.1 KiB |
| Minimum | 2025-01-01 00:00:00 |
|---|---|
| Maximum | 2025-12-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
product_name
Text
| Distinct | 1000 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 91 |
| Missing (%) | 0.5% |
| Memory size | 157.1 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.89465 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Product_468 |
|---|---|
| 2nd row | Product_810 |
| 3rd row | Product_821 |
| 4th row | Product_633 |
| 5th row | Product_350 |
| Value | Count | Frequency (%) |
| product_905 | 36 | 0.2% |
| product_428 | 35 | 0.2% |
| product_644 | 33 | 0.2% |
| product_281 | 32 | 0.2% |
| product_136 | 32 | 0.2% |
| product_362 | 31 | 0.2% |
| product_749 | 31 | 0.2% |
| product_119 | 31 | 0.2% |
| product_717 | 31 | 0.2% |
| product_683 | 31 | 0.2% |
| Other values (990) | 19677 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 20000 | |
| r | 20000 | |
| o | 20000 | |
| d | 20000 | |
| u | 20000 | |
| c | 20000 | |
| t | 20000 | |
| _ | 20000 | |
| 7 | 6142 | 2.8% |
| 9 | 6067 | 2.8% |
| Other values (8) | 45684 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 217893 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 20000 | |
| r | 20000 | |
| o | 20000 | |
| d | 20000 | |
| u | 20000 | |
| c | 20000 | |
| t | 20000 | |
| _ | 20000 | |
| 7 | 6142 | 2.8% |
| 9 | 6067 | 2.8% |
| Other values (8) | 45684 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 217893 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 20000 | |
| r | 20000 | |
| o | 20000 | |
| d | 20000 | |
| u | 20000 | |
| c | 20000 | |
| t | 20000 | |
| _ | 20000 | |
| 7 | 6142 | 2.8% |
| 9 | 6067 | 2.8% |
| Other values (8) | 45684 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 217893 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 20000 | |
| r | 20000 | |
| o | 20000 | |
| d | 20000 | |
| u | 20000 | |
| c | 20000 | |
| t | 20000 | |
| _ | 20000 | |
| 7 | 6142 | 2.8% |
| 9 | 6067 | 2.8% |
| Other values (8) | 45684 |
category
Categorical
Missing
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1084 |
| Missing (%) | 5.4% |
| Memory size | 157.1 KiB |
| Computer | |
|---|---|
| Mobile Accessories | |
| Wearable | |
| Audio | |
| Electronics |
Length
| Max length | 18 |
|---|---|
| Median length | 11 |
| Mean length | 9.9775346 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Computer |
|---|---|
| 2nd row | Electronics |
| 3rd row | Audio |
| 4th row | Computer |
| 5th row | Electronics |
Common Values
| Value | Count | Frequency (%) |
| Computer | 4066 | |
| Mobile Accessories | 4065 | |
| Wearable | 3973 | |
| Audio | 3962 | |
| Electronics | 2941 | |
| (Missing) | 1084 | 5.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| computer | 4066 | |
| mobile | 4065 | |
| accessories | 4065 | |
| wearable | 3973 | |
| audio | 3962 | |
| electronics | 2941 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 27148 | |
| o | 19099 | 10.1% |
| s | 15136 | 8.0% |
| r | 15045 | 7.9% |
| i | 15033 | 7.9% |
| c | 14012 | 7.4% |
| l | 10979 | 5.8% |
| b | 8038 | 4.2% |
| u | 8028 | 4.2% |
| A | 8027 | 4.2% |
| Other values (11) | 49098 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 189643 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 27148 | |
| o | 19099 | 10.1% |
| s | 15136 | 8.0% |
| r | 15045 | 7.9% |
| i | 15033 | 7.9% |
| c | 14012 | 7.4% |
| l | 10979 | 5.8% |
| b | 8038 | 4.2% |
| u | 8028 | 4.2% |
| A | 8027 | 4.2% |
| Other values (11) | 49098 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 189643 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 27148 | |
| o | 19099 | 10.1% |
| s | 15136 | 8.0% |
| r | 15045 | 7.9% |
| i | 15033 | 7.9% |
| c | 14012 | 7.4% |
| l | 10979 | 5.8% |
| b | 8038 | 4.2% |
| u | 8028 | 4.2% |
| A | 8027 | 4.2% |
| Other values (11) | 49098 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 189643 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 27148 | |
| o | 19099 | 10.1% |
| s | 15136 | 8.0% |
| r | 15045 | 7.9% |
| i | 15033 | 7.9% |
| c | 14012 | 7.4% |
| l | 10979 | 5.8% |
| b | 8038 | 4.2% |
| u | 8028 | 4.2% |
| A | 8027 | 4.2% |
| Other values (11) | 49098 |
price
Real number (ℝ)
| Distinct | 778 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 91 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2921.1703 |
| Minimum | 105 |
|---|---|
| Maximum | 50000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 157.1 KiB |
Quantile statistics
| Minimum | 105 |
|---|---|
| 5-th percentile | 235 |
| Q1 | 889 |
| median | 1702 |
| Q3 | 2507 |
| 95-th percentile | 10000 |
| Maximum | 50000 |
| Range | 49895 |
| Interquartile range (IQR) | 1618 |
Descriptive statistics
| Standard deviation | 5693.9289 |
|---|---|
| Coefficient of variation (CV) | 1.9491945 |
| Kurtosis | 43.975718 |
| Mean | 2921.1703 |
| Median Absolute Deviation (MAD) | 813 |
| Skewness | 6.1033453 |
| Sum | 58423405 |
| Variance | 32420827 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 1565 | 7.8% |
| 25000 | 209 | 1.0% |
| 50000 | 190 | 0.9% |
| 1665 | 84 | 0.4% |
| 2487 | 76 | 0.4% |
| 1006 | 75 | 0.4% |
| 427 | 75 | 0.4% |
| 2299 | 72 | 0.4% |
| 1268 | 72 | 0.4% |
| 3022 | 72 | 0.4% |
| Other values (768) | 17510 | |
| (Missing) | 91 | 0.5% |
| Value | Count | Frequency (%) |
| 105 | 22 | |
| 106 | 33 | |
| 108 | 14 | 0.1% |
| 113 | 24 | |
| 116 | 53 | |
| 117 | 22 | |
| 121 | 22 | |
| 125 | 16 | 0.1% |
| 128 | 23 | |
| 129 | 19 | 0.1% |
| Value | Count | Frequency (%) |
| 50000 | 190 | 0.9% |
| 25000 | 209 | 1.0% |
| 10000 | 1565 | |
| 3099 | 40 | 0.2% |
| 3098 | 18 | 0.1% |
| 3097 | 29 | 0.1% |
| 3090 | 24 | 0.1% |
| 3086 | 21 | 0.1% |
| 3080 | 19 | 0.1% |
| 3074 | 22 | 0.1% |
stock
Real number (ℝ)
Missing
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 711 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.855315 |
| Minimum | -5 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 196 |
| Negative (%) | 1.0% |
| Memory size | 157.1 KiB |
Quantile statistics
| Minimum | -5 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 24 |
| median | 49 |
| Q3 | 75 |
| 95-th percentile | 95 |
| Maximum | 100 |
| Range | 105 |
| Interquartile range (IQR) | 51 |
Descriptive statistics
| Standard deviation | 29.443116 |
|---|---|
| Coefficient of variation (CV) | 0.59057126 |
| Kurtosis | -1.1957909 |
| Mean | 49.855315 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | -0.012968924 |
| Sum | 966196 |
| Variance | 866.89707 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42 | 386 | 1.9% |
| 74 | 384 | 1.9% |
| 14 | 355 | 1.8% |
| 89 | 335 | 1.7% |
| 70 | 324 | 1.6% |
| 93 | 318 | 1.6% |
| 37 | 308 | 1.5% |
| 49 | 294 | 1.5% |
| 21 | 285 | 1.4% |
| 98 | 282 | 1.4% |
| Other values (91) | 16109 | |
| (Missing) | 711 | 3.5% |
| Value | Count | Frequency (%) |
| -5 | 196 | |
| 1 | 208 | |
| 2 | 144 | |
| 3 | 247 | |
| 4 | 229 | |
| 5 | 86 | 0.4% |
| 6 | 270 | |
| 7 | 177 | |
| 8 | 213 | |
| 9 | 221 |
| Value | Count | Frequency (%) |
| 100 | 132 | |
| 99 | 196 | |
| 98 | 282 | |
| 97 | 123 | 0.6% |
| 96 | 147 | |
| 95 | 214 | |
| 94 | 154 | |
| 93 | 318 | |
| 92 | 258 | |
| 91 | 103 | 0.5% |
Interactions
Correlations
| age | amount | category | city | customer_id | gender | income | payment_mode | price | stock | |
|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.001 | 0.002 | 0.041 | -0.007 | 0.040 | -0.022 | 0.001 | 0.001 | -0.010 |
| amount | 0.001 | 1.000 | 0.000 | 0.003 | 0.000 | 0.000 | -0.004 | 0.000 | 0.008 | -0.011 |
| category | 0.002 | 0.000 | 1.000 | 0.000 | 0.014 | 0.003 | 0.000 | 0.000 | 0.383 | 0.090 |
| city | 0.041 | 0.003 | 0.000 | 1.000 | 0.044 | 0.050 | 0.035 | 0.010 | 0.007 | 0.000 |
| customer_id | -0.007 | 0.000 | 0.014 | 0.044 | 1.000 | 0.045 | 0.003 | 0.012 | -0.002 | 0.004 |
| gender | 0.040 | 0.000 | 0.003 | 0.050 | 0.045 | 1.000 | 0.036 | 0.000 | 0.000 | 0.000 |
| income | -0.022 | -0.004 | 0.000 | 0.035 | 0.003 | 0.036 | 1.000 | 0.000 | -0.003 | 0.013 |
| payment_mode | 0.001 | 0.000 | 0.000 | 0.010 | 0.012 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| price | 0.001 | 0.008 | 0.383 | 0.007 | -0.002 | 0.000 | -0.003 | 0.000 | 1.000 | 0.008 |
| stock | -0.010 | -0.011 | 0.090 | 0.000 | 0.004 | 0.000 | 0.013 | 0.000 | 0.008 | 1.000 |
Missing values
Sample
| customer_id | name | age | gender | city | income | transaction_id | product_id | amount | payment_mode | date | product_name | category | price | stock | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | Arjun Verma | 56 | Female | Jaipur | 896150.0 | T002318 | P0468 | 281.0 | UPI | 2025-11-15 | Product_468 | Computer | 1006.0 | 64.0 |
| 1 | 1 | Arjun Verma | 56 | Female | Jaipur | 896150.0 | T004426 | P0810 | 822.0 | Net Banking | 2025-09-08 | Product_810 | Electronics | 10000.0 | NaN |
| 2 | 1 | Arjun Verma | 56 | Female | Jaipur | 896150.0 | T012020 | P0821 | 680.0 | Cash | 2025-12-18 | Product_821 | Audio | 2710.0 | 72.0 |
| 3 | 2 | Shaurya Khan | 32 | Male | Hyderabad | 758372.0 | T004924 | P0633 | 4006.0 | Cash | 2025-10-14 | Product_633 | Computer | 2734.0 | 21.0 |
| 4 | 3 | Anika Verma | 38 | Female | Surat | 184812.0 | T002934 | P0350 | 2645.0 | UPI | 2025-03-06 | Product_350 | Electronics | 25000.0 | -5.0 |
| 5 | 3 | Anika Verma | 38 | Female | Surat | 184812.0 | T004198 | P0939 | 1774.0 | Debit Card | 2025-03-23 | Product_939 | Mobile Accessories | 613.0 | 3.0 |
| 6 | 3 | Anika Verma | 38 | Female | Surat | 184812.0 | T008754 | P0205 | 3322.0 | UPI | 2025-11-10 | Product_205 | Electronics | 294.0 | 5.0 |
| 7 | 3 | Anika Verma | 38 | Female | Surat | 184812.0 | T009173 | P0885 | 1722.0 | UPI | 2025-07-02 | Product_885 | Electronics | 1047.0 | 10.0 |
| 8 | 3 | Anika Verma | 38 | Female | Surat | 184812.0 | T010772 | P0884 | 392.0 | Cash | 2025-05-02 | Product_884 | Mobile Accessories | 2253.0 | 50.0 |
| 9 | 3 | Anika Verma | 38 | Female | Surat | 184812.0 | T010879 | P0854 | 5190.0 | UPI | 2025-12-08 | Product_854 | Mobile Accessories | 532.0 | 52.0 |
| customer_id | name | age | gender | city | income | transaction_id | product_id | amount | payment_mode | date | product_name | category | price | stock | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20081 | 4999 | Aarav Sharma | 19 | Female | Delhi | 678984.0 | T013342 | P0001 | 616.0 | Debit Card | 2025-09-18 | Product_1 | Audio | 1940.0 | 53.0 |
| 20082 | 4999 | Aarav Sharma | 19 | Female | Delhi | 678984.0 | T015424 | P0943 | 319.0 | Cash | 2025-09-25 | Product_943 | Computer | 1387.0 | 37.0 |
| 20083 | 4999 | Aarav Sharma | 19 | Female | Delhi | 678984.0 | T015643 | P0026 | 4377.0 | Credit Card | 2025-06-19 | Product_26 | Audio | 2702.0 | 14.0 |
| 20084 | 4999 | Aarav Sharma | 19 | Female | Delhi | 678984.0 | T016522 | P0223 | 6801.0 | Credit Card | 2025-05-10 | Product_223 | Computer | 2678.0 | 7.0 |
| 20085 | 5000 | Diya Nair | 40 | Male | Jaipur | 7028217.0 | T000041 | P0186 | 295.0 | Net Banking | 2025-08-23 | Product_186 | Audio | 213.0 | 3.0 |
| 20086 | 5000 | Diya Nair | 40 | Male | Jaipur | 7028217.0 | T002479 | P0224 | 1274.0 | UPI | 2025-04-06 | Product_224 | Mobile Accessories | 2296.0 | 48.0 |
| 20087 | 5000 | Diya Nair | 40 | Male | Jaipur | 7028217.0 | T012316 | P0686 | 160.0 | UPI | 2025-09-23 | Product_686 | Audio | 754.0 | 2.0 |
| 20088 | 5000 | Diya Nair | 40 | Male | Jaipur | 7028217.0 | T014907 | P0415 | 6832.0 | Credit Card | 2025-09-29 | Product_415 | Electronics | 1706.0 | 60.0 |
| 20089 | 5000 | Diya Nair | 40 | Male | Jaipur | 7028217.0 | T017816 | P0667 | 756.0 | Net Banking | 2025-03-06 | Product_667 | Wearable | 2179.0 | 24.0 |
| 20090 | 5000 | Diya Nair | 40 | Male | Jaipur | 7028217.0 | T019104 | P0300 | 1544.0 | Net Banking | 2025-12-11 | Product_300 | None | 50000.0 | NaN |